The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions.
نویسندگان
چکیده
Different methods of extracting speech features from an auditory model were systematically investigated in terms of their robustness to different noises. The methods either computed the average firing rate within frequency channels (spectral features) or inter-spike-intervals (timing features) from the simulated auditory nerve response. When used as the front-end for an automatic speech recognizer, timing features outperformed spectral features in Gaussian noise. However, this advantage was lost in babble, because timing features extracted the spectro-temporal structure of babble noise, which is similar to the target speaker. This suggests that different feature extraction methods are optimal depending on the background noise.
منابع مشابه
A Noise Suppression Technique using an Auditory Model
In this paper we describe an efficient speech analysis model based on the properties of the peripheral auditory system. This model uses a bank of gamma-tone filters followed by a model of adaptation that occurs in auditory nerve fibers. The model produces a speech representation in terms of the mean firing rate. A noise suppression mechanism is included in order to obtain higher speech recognit...
متن کاملA Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery
Speaker identification under noisy conditions is one of the challenging topics in the field of speech processing applications. Motivated by the fact that the neural responses are robust against noise, this paper proposes a new speaker identification system using 2-D neurograms constructed from the responses of a physiologically-based computational model of the auditory periphery. The responses ...
متن کاملNoise-Robust Speech Recognition Through Auditory Feature Detection and Spike Sequence Decoding
Speech recognition in noisy conditions is a major challenge for computer systems, but the human brain performs it routinely and accurately. Automatic speech recognition (ASR) systems that are inspired by neuroscience can potentially bridge the performance gap between humans and machines. We present a system for noise-robust isolated word recognition that works by decoding sequences of spikes fr...
متن کاملThe representation of noise vocoded speech in the auditory nerve of the chinchilla: physiological correlates of the perception of spectrally reduced speech.
This study investigated the neural representation of naturally produced and noise vocoded speech signals in the auditory nerve of the chinchilla. The syllables [see text] produced by male speakers were used to synthesize noise vocoded speech stimuli containing one, two, three and four bands of envelope modulated noise. The ensemble response of the auditory nerve, computed by pooling the PST his...
متن کاملEffect of Vowel Auditory Training on the Speech-In-Noise Perception among Older Adults with Normal Hearing
Introduction: Aging reduces the ability to understand speech in noise. Hearing rehabilitation is one of the ways to help older people communicate effectively. This study aimed to investigate the effect of vowel auditory training on the improvement of speech-in-noise (SIN) perception among elderly listeners. Materials and Methods: This study was conducted on 36 elderly ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The Journal of the Acoustical Society of America
دوره 134 3 شماره
صفحات -
تاریخ انتشار 2013